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Abstract. Using a calculus of variations approach, we determine the shape of 
a typical plane partition in a large box (i.e., a plane partition chosen at random 
according to the uniform distribution on all plane partitions whose solid Young 
diagrams fit inside the box). Equivalently, we describe the distribution of the 
three different orientations of lozenges in a random lozenge tiling of a large 
hexagon. We prove a generalization of the classical formula of MacMahon for 
the number of plane partitions in a box; for each of the possible ways in which 
the tilings of a region can behave when restricted to certain lines, our formula 
tells the number of tilings that behave in that way. When we take a suitable 
limit, this formula gives us a functional which we must maximize to determine 
the asymptotic behavior of a plane partition in a box. Once the variational 
problem has been set up, we analyze it using a modification of the methods 
employed by Logan and Shepp and by Vershik and Kerov in their studies of 
random Young tableaux. 
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1. Introduction 

In this paper we will show that almost all plane partitions that are constrained 
to lie within an a x b x c box have a certain approximate shape, if a, b, and c are 
large; moreover, this limiting shape depends only on the mutual ratios of a, 6, and 
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Figure 1. The tiling corresponding to a plane partition. 

c. Our proof will make use of the equivalence between such plane partitions and 
tilings of hexagons by rhombuses. 

Recall that plane partitions are a two-dimensional generalization of ordinary 
partitions. A plane partition n is a collection of non-negative integers n x ,y indexed 
by ordered pairs of non-negative integers x and y, such that only finitely many 
of the integers Tr x , y are non-zero, and for all x and y we have n x +i,y 5~ n x,y and 
n x,y+i < Kx.y A more symmetrical way of looking at a plane partition is to 
examine the union of the unit cubes [i, i + 1] x \j, j + 1] x [k, k + 1] with i, j, and k 
non-negative integers satisfying < k < 7Tj j. This region is called the solid Young 
diagram associated with the plane partition, and its volume is the sum of the Ttx^'s. 

We say that a plane partition tt fits within an a x 6 x c box if its solid Young 
diagram fits inside the rectangular box [0, a] x [0, b] x [0, c] , or equivalently, if ix x>y < c 
for all x and y, and -K x , y = whenever x > a or y > b; we call such a plane 
partition a boxed plane partition. Plane partitions in an a x b x c box are in 
one-to-one correspondence with tilings of an equi-angular hexagon of side lengths 
a, 6, c, a, 6, c by rhombuses whose sides have length 1 and whose angles measure ir/3 
and 2tt/3. It is not hard to write down a bijection between the plane partitions and 
the tilings, but the correspondence is best understood informally, as follows. The 
faces of the unit cubes that constitute the solid Young diagram are unit squares. 
Imagine now that we augment the solid Young diagram by adjoining the three 
"lower walls" of the a x b x c box that contains it (namely {0} x [0,6] x [0, c], 
[0, a] x {0} x [0, c], and [0, a] x [0,6] x {0}); imagine as well that each of these 
walls is divided into unit squares. If we look at this augmented Young diagram 
from a point on the line x = y = z, certain of the unit squares are visible (that 
is, unobstructed by cubes). These unit squares form a surface whose boundary 
is the non-planar hexagon whose vertices are the points (a, 0,c), (0,0, c), (0, b, c), 
(0,6,0), (a, 6,0), (a, 0,0), and (a,0, c), respectively. Moreover, these same unit 
squares, projected onto the plane x + y + z — and scaled, become rhombuses 
which tile the aforementioned planar hexagon. For example, the plane partition 7r 
in a 2 x 2 x 2 box defined by 7r 0! o = 7r 0i i = ~K\ _o = 1 and 7:11 = corresponds to the 
tiling in Figure [l| where the points (2,0,0), (0,2,0), and (0,0,2) are at the lower 
left, extreme right, and upper left corners of the hexagon. (The shading is meant 
as an aid for three-dimensional visualization, and is not necessary mathematically. 
The unshaded rhombuses represent part of the walls.) 

We will use the term a, 6, c hexagon to refer to an equi-angular hexagon of side 
lengths a, 6, c, a, 6, c (in clockwise order, with the horizontal sides having length 
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b), and the term lozenge to refer to a unit rhombus with angles of 7r/3 and 2tt/3. 
We will focus, without loss of generality, on those lozenges whose major axes are 
vertical, which we call vertical lozenges. Although our method in this article is 
to reduce facts about plane partitions to facts about tilings, one can also go in 
the reverse direction. For example, one can see from the three-dimensional picture 
that in every lozenge tiling of an a, b, c hexagon, the number of vertical lozenges 
is exactly ac (with similar formulas for the other two orientations of lozenge); see 
|DT| for further discussion. 

A classical formula of MacMahon Jm| asserts that there are exactly 

a-l b-1 c-1 , _ 

i=Q j=0 fc=0 J 

plane partitions that fit in an a x b x c box, or (equivalently) lozenge tilings of an 



a, b, c hexagon. In Theorem 2/2 of this article, we give a generalization that counts 
lozenge tilings wit h pr escribed behavior on a given horizontal line. 

Using Theorem |2.2| , we will determine the shape of a typical plane partition in an 
axbxc box (Theorem |l.2| ) . Specifically, that theorem implies that if a, b, c are large, 
then the solid Young diagram of a random plane partition inanaxdxc box is almost 
certain to differ from a particular, "typical" shape by an amount that is negligible 
compared to abc (the total volume of the box) . Equivalently, the visible squares in 
the augmented Young diagram of the random boxed plane partition form a surface 
whose maximum deviation from a particular, typical surface is almost certain to be 
o(min(a, b, c)). Moreover, scaling a, 6, c by some factor causes the typical shape of 
this bounding surface to scale by the same factor. 

Before we say what the true state of affairs is, we invite the reader to come 
up with a guess for what this typical shape should be. One natural way to arrive 
at a guess is to consider the analogous problem for ordinary (rather than plane) 
partitions. If one considers all ordinary partitions that fit inside an a x 6 rectangle 
(in the sense that their Young diagrams fit inside [0, a] x [0, b)), then it is not hard 
to show that the typical boundary of the diagram is the line x/a + y/b= 1; that is, 
almost all such partitions have roughly triangular Young diagrams. (One way to 
prove this is to apply Stirling's formula to binomial coefficients and employ direct 
counting; another is to use probabilistic methods, aided by a verification that if we 
look at the boundary of the Young diagram of the partition as a lattice path, then 
the directions of different steps in the path are negatively correlated.) It therefore 
might seem plausible that the typical bounding surface for plane partitions would 
be a plane (except where it coincided with the sides of the box), as when a box 



is tilted on its corner and half- filled with fluid. However, Theorem 1 .1 shows that 
that is in fact not the case. 

To see what a typical boxed plane partition does look like, see Figure 0. This 



tiling was generated using the methods of [PW| and is truly random, to the extent 
that pseudo-random number generators can be trusted. Notice that near the corners 
of the hexagon, the lozenges are aligned with each other, while in the middle, 
lozenges of different orientations are mixed together. If the bounding surface of 
the Young diagram tended to be flat, then the central zone of mixed orientations 
would be an inscribed hexagon, and the densities of the three orientations of tiles 
would change discontinuously as one crossed into this central zone. In fact, what 




Figure 2. A random lozenge tiling of a 32,32,32 hexagon. 



one observes is that the central zone is roughly circular, and that the tile densities 
vary continuously except near the midpoints of the sides of the original hexagon. 

One can in theory use our results to obtain an explicit formula for the typical 
shape of the bounding surface, in which one specifies the distance from a point P 
on the surface to its image P' under projection onto the x + y + z — plane, as a 
function of P'\ however, the integral that turns up is quite messy (albeit evaluable 
in closed form), with the result that the explicit formula is extremely lengthy and 
unenlightening. Nevertheless, we can and do give a fairly simple formula for the 
tilt of the tangent plane at P as a function of the projection P', which would 
allow one to reconstruct the surface itself. In view of the correspondence between 
plane partitions and tilings, specifying the tilt of the tangent plane is equivalent 
to specifying the local densities of the three different orientations of lozenges for 
random tilings of an a, 6, c hexagon. 

Our result on local densities has as a particular corollary the assertion that, in 
an asymptotic sense, the zone of mixed orientation (defined as the region in which 
all three orientations of lozenge occur with positive density) is precisely the interior 
of the ellipse inscribed in the hexagon. This behavior is analogous to what has 
been proved concerning domino tilings of regions called Aztec diamonds (see [ JPS| 
and [CEP |); these are roughly square regions, and if one tiles them randomly with 
dominos (1x2 rectangles), then the zone of mixed orientation tends in the limit 
to the inscribed disk. However, the known results for Aztec diamonds are stronger 
than the corresponding best known results for hexagons (see Conjecture 6.2 in 
Section ||) . 
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To state our main theorem, we begin by setting up normalized coordinates. 
Suppose that we are dealing with an a, b, c hexagon, so that the side lengths are 
a, b, c, a, b, c in clockwise order (with the sides of length b horizontal). We let a, b, c 
tend to infinity together, in such a way that the three-term ratio a : b : c (i.e., 
element of the projective plane P 2 (K)) converges to a : (3 : 7 for some fixed positive 
numbers a, (3, 7. Say a + b + c = a(a + (3 + 7) for some scaling factor a. Then we 
choose re-scaled coordinates for the a,b,c hexagon so that its sides are a' = a/cr, 
b' = 6/cr, and c' = c/a (which by the hypothesis of our main theorem will be 
required to converge to a, [3, and 7 as a, b, and c get large). Note that a',b',c' 
in general are not integers. The origin of our coordinate system lies at the center 
of the hexagon. One can check that the sides of the hexagon lie on the lines 
y= ^(2x + b' + c'), y = ^(a' + c'), y= ^(-2x + a' + b'), y = ^{2x-b'-c'), 

y = ^(— a' — c'), and y = ^(—2x — a' — b'), and that the inscribed ellipse is 
described by 

3(a'+c') 2 x 2 -2V3(a'+2b'+c')(a'^c')xy+{{a'+2b'+c / ) 2 -4a'c')y 2 = 3a'b'c'{a'+b'+c'). 
Define E a jj^(x,y) to be the polynomial 

3a/37(a + /3 + 7)-(3(a+7) 2 x 2 -2V3(a + 2/3+7)(a-7)a;y+((a + 2/3+7) 2 -4a7)y 2 ), 

whose zero-set is the ellipse inscribed in the a, (3, 7 hexagon. Also define Q a j3,y( x , y) 
to be the polynomial 



Qy 2 - 4x 2 + f3 2 + ap + fa - 



which will be useful shortly. 

There are six points at which the ellipse inscribed in the a, /3, 7 hexagon meets 
the boundary of the hexagon. The four that occur on sides of length a or 7 will be 
called singular points, for reasons that will be clear shortly. (Recall that we have 
already broken the underlying symmetry of the situation by agreeing to focus on 
vertical lozenges.) The points of the hexagon that lie outside the inscribed ellipse 
form six connected components. Let R\ be the closure of the union of the two 
components containing the leftmost and rightmost points of the hexagon, minus 
the four singular points, and let Rq be the closure of the union of the other four 
components, again minus the four singular points. 

Finally, define the (normalized) coordinates of a vertical lozenge to be the (nor- 
malized) coordinates of its center. Then the following theorem holds: 

Theorem 1.1. Let U be the interior of a smooth simple closed curve contained 
inside the a, (3, 7 hexagon, with a, (3, 7 > 0. In a random tiling of an a, 6, c hexagon, 
as a, b, c — > 00 with a : b : c — > a : (3 : -f, the expected number of vertical lozenges 
whose normalized coordinates lie in U is ab + bc + ac times 

where A = (a[3 + f3j + aj)^/3/2 is the area of the a, (3, 7 hexagon and 



V a .j3 n (x,y) = < 



k \y/E ail3n (x,y) J 

if (a;, y) e Rq, and 

1 ]S(x,y)eRi. 
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In fact, our proof will give an even stronger version of Theorem |l.l| , in which 
U is a horizontal line segment rather than an open set (and the double integral is 
replaced by a single integral). Since we can derive the expected number of vertical 
lozenges whose normalized coordinates lie in U by integrating over all horizontal 
cross sections (provided that the error term is uniform for all cross sections, as 
we will prove in Section ^), this variant of the claim implies the one stated above, 
though it is not obviously implied by it. We have stated the result in terms of open 
sets because that formulation seems more natural; the proof just happens to tell us 
more. 

The intuition behind Theorem [0| is that f a ^ 7 (s, y) gives the density of vertical 
lozenges in the normalized vicinity of (x, y); the factor of ab + bc + ac arises simply 
because there are that many lozenges in a tiling of an a, 6, c hexagon. One might be 
tempted to go farther and think of V a ,p.y(x, y) as the asymptotic probability that 
a random tiling of the a, 6, c hexagon will have a vertical lo zeng e at any particular 
location in the normalized vicinity of (x, y) (see Conjecture 6A. in Section ||); how- 



ever, we cannot justify this interpretation rigorously, because it is conceivable that 
there are small-scale fluctuations in the probabilities that disappear in the double 



integral. (In Subsection 1.3 of | CEP |, it is shown that the analogous probabilities 
for random domino tilings do exhibit such fluctuations, although the fluctuations 
disappear if one distinguishes between four classes of dominos, rather than just 
horizontal and vertical dominos.) 

The formula for "P Qi /3. 7 is more natural than it might appear. Examination of 
random tilings such as the one shown in Figure || leads one to conjecture that 
the region in which all three orientations of lozenges occur with positive density is 
(asymptotically) the interior of the ellipse inscribed in the hexagon, and the known 
fact that an analogous claim holds for random domino tilings of Aztec diamonds 



(sec [CEP |) lends further support to this hypothesis. This leads one to predict that 
Tafi^ will be in Rq and 1 in R\, and strictly between and 1 in the interior of 
the ellipse. Comparison with the analogous theorem for domino tilings (Theorem 1 



of CEP]) suggests that within the inscribed ellipse, V a ^^(x^y) should be of the 
form 

l cot _J Q(x,y) \ 



for some quadratic polynomial Q(x,y), and the only problem is actually finding 
Q(x,y) in terms of a, (3, 7. A simple description of the polynomial Q(x,y) that 
actually works is that its zero-set is the unique hyperbola whose asymptotes are 
parallel to the non-horizontal sides of the hexagon and which goes through the 
four points on the boundary of R\ where the inscribed ellipse is tangent to the 
hexagon. This determines Q{x, y) up to a constant factor, and that constant factor 
is determined (at least in theory — in practice the calculations would be cumber- 
some) by requiring that the average of V a ^^(x^y) over the entire hexagon must 
be aj/(a/3 + j3j + cry). 



An alternative way to phrase Theorem 1.1 is in terms of height functions, which 



were introduced by Thurston in [ Th | as a geometrical tool for understanding tilings 
of regions by lozenges or dominos. A height function is a numerical representation 
of an individual tiling of a specified region. In the case of lozenge tilings of a 
hexagon, the vertices of the lozenges occur at points of a certain triangular lattice 
that is independent of the particular tiling chosen, and the height function simply 
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Figure 3. The height function corresponding to a tiling. 



associates a certain integer to each such vertex so as to describe the shape of the 
plane partition that corresponds to the tiling. Given any lozenge tiling, one can 
assign heights h(u) to the points u of the triangular lattice as follows. Give the 
leftmost vertex of the hexagon height a + c. (We choose this height so that the 
vertex of the bounding box farthest from the viewer, which is usually obscured from 
view, will have height 0.) Suppose that u and v are adjacent lattice points, such 
that the edge connecting them does not bisect a lozenge. If the edge from u to 
v points directly to the right, set h(v) = h(u) + 1, and if it points up and to the 
right, or down and to the right, set h(v) = h(u) — 1. (If it points left, change +1 to 
— 1 and vice versa.) If one traces around each lozenge in the tiling and follows this 
rule, then every vertex is assigned a height. It is not hard to check that the heights 
are well-defined, so there is a unique height function associated to the tiling. For 
an example, see Figure [|. Conversely, every way of assigning integer heights to the 
vertices of the triangular grid that assigns height a + c to the leftmost vertex of the 
hexagon and that satisfies the edge relation must be the height function associated 
to some unique tiling. If one views the tiling as a three-dimensional picture of the 
solid Young diagram of a plane partition, then the height function tells how far 
above a reference plane (of the form x + y + z = constant) each vertex lies. The 
values of the height function on the boundary of the hexagon are constrained, but 
all the values in the interior genuinely depend on the tiling. (It should be mentioned 
that height functions for lozenge tilings are implicit in the work of physicists Blote 
and Hilhorst, in the context of the two-dimensional dimer model on a hexagonal 



lattice; see [BH].) 

When dealing with normalized coordinates, it is convenient to use a normalized 
height function: if we scale the coordinates by dividing them by a (as described 
above), then we also divide the height function values by a. Also, we define the 
average height function to be the average of the height functions associated with 
all possible tilings. (Of course, it is not a height function itself.) We will show 
that almost every height function closely approximates the average height function, 
asymptotically : 

Theorem 1.2. Fix 7 > 0. As a, b, c — > 00 with a : b : c — > a : (3 : 7, the 

normalized average height function of the a,b,c hexagon converges uniformly to the 
function 7i Q ,/3, 7 with the appropriate (piecewise linear) boundary conditions such 



8 



HENRY COHN, MICHAEL LARSEN, AND JAMES PROPP 



that 

dH a ,[3,~f(x,y) 



1 - 3V a ,i3,-y(x,y)- 



dx 

For fixed e > 0, the probability that the normalized height function of a random 
tiling will differ anywhere by more than e from H. a ,0,-y is exponentially small in a , 
where a — (a + b + c)/ (a + (3 + 7). 

It is not hard to deduce Theorem |l.2| (with the exception of the claim made in 



the last sentence) from Theorem 1.1; in particular, the differential equation simply 
results from considering how the height changes as one crosses lozenges of each 
orientation. We will give the proof in detail in Section [|. 

Unfortunately, although one can recover an explicit formula for H. a ^^(x, y) from 
the boundary values and the knowledge of d7i a ,0,^(x, y) / dx, we cannot find any 



simple formula for it. By contrast, Proposition 17 of JCEP gives a comparatively 



simple asymptotic formula for the average height function for domino tilings of an 
Aztec diamond. 

Our methods also apply to the case of random domino tilings of Aztec diamonds. 



Formula (7) of Section 4 of | EKLP is analogous to our Theorem 2J2, and can be 
used in the same way. It turns out that the functional arrived at by applying the 
methods of Section [| of this paper to that formula is very similar to the one we will 
find in Section |[ After a simple change of variables, one ends up with essentially 
the same functional, but maximized over a slightly different class of functions. The 
methods of Section |] apply almost without chang e, and the methods of Section || 
can be adapt ed to prove Proposi tion 1 7 of |CEP |. This proof is shorter than the 



one given m pEPl ; however, in |CEP| , Proposition 17 comes as a corollary of a 



much stronger result (Theorem 1), which the methods of this paper do not prove. 

2. The Product Formula 
In this section, we will prove a refinement of MacMahon's formula, following 



methods first used by Elkies et al. in [EKLP]. This refinement (Proposition EJJ) is 



not strictly speaking new, since it is really nothing more than the Weyl dimension 
formula for finite-dimensional representations of SL(n) (we say a few words more 
about this connection below). However, we give our own proof of this result for two 
reasons: first, to make this part of the proof self-contained; and second, to illustrate 



an expeditious method of proof that has found applications elsewhere (see [ IP for 
related formul as d erived by the same method). 



Proposition |2.1| is stated in terms of Gelfand patterns, so we must first explain 
what Gelfand patterns are and what they have to do with lozenge tilings. It is 
not hard to see that a lozenge tiling of a hexagon is determined by the locations 
of its vertical lozenges. A semi-strict Gelfand pattern is a way to keep track of 
these locations. Specifically, one augments the a, b, c hexagon by adding a(a + l)/2 
vertical lozenges on the left and c(c + l)/2 vertical lozenges on the right, forming 
an approximate trapezoid of upper base a + b + c and lower base b, with some 
triangular protrusions along its upper border, as in Figure ^. One then associates 
with each vertical lozenge in the tiling the horizontal distance from its right corner 
to the left border of the trapezoid, which we call its trapezoidal position. (When 
we want to use the left boundary of the hexagon instead of the left boundary 
of the approximate trapezoid as our bench mark, we will speak of the hexagonal 
position of a vertical lozenge.) For example, consider the tiling shown in Figure [IJ 
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Figure 4. The semi-strict Gelfand pattern corresponding to a tiling. 

we augment it by adding 6 vertical lozenges to form the tiling shown in Figure |J 
The vertical lozenges form rows, and the only restriction on their placement is that 
given any two adjacent vertical lozenges in the same row, there must be exactly one 
vertical lozenge between them in the row immediately beneath them. If we index 
the locations of the vertical lozenges with their trapezoidal positions (the numbers 
shown in Figure ^) , we arrive at the following semi-strict Gelfand pattern: 

12 5 6 
1 2 5 
1 4 
2 

In general, a semi-strict Gelfand pattern is a triangular array of integers (such as 
this one) , with the property that the entries increase weakly along diagonals running 
down and to the right, and the entries increase strictly along diagonals running up 
and to the right. As discussed above, there is a simple bijection between semi-strict 
Gelfand patterns with top row 1, 2, . . . , a, a + b+l, a + 6 + 2, . . . , a + b + c and lozenge 
tilings of an equi-angular hexagon with side lengths a, 6, c, a, 6, c. 

Moreover, consider the fc-th horizontal line from the top in an a, b, c hexagon, 
where the top edge of the hexagon corresponds to k — 0, so that the trapezoidal 
positions of the vertical lozenges on the fc-th line are precisely the entries in the 
(fc+l)-st row of the associated semi-strict Gelfand pattern. If we discard all lozenges 
that lie above the line (but retain all vertical lozenges that are bisected by it and all 
lozenges that lie below it), then we get a tiling of a smaller approximate trapezoid, 
whose upper border as before consists of triangular protrusions alternating with 
straight edges, except that now the protrusions need not be concentrated at the 
left and right portions of the border. The trapezoidal positions of the vertical 
lozenges in this tiling are given by the entries in the semi-strict Gelfand pattern 
obtained from the original semi-strict Gelfand pattern by deleting the first k rows. 
In fact, if we limit ourselves to tilings of the a, 6, c hexagon in which the locations 
of the vertical lozenges that are bisected by the fc-th horizontal line are specified, 
then each individual tiling of this kind corresponds to a pair of semi-strict Gelfand 
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patterns. We have already described one of these Gelfand patterns, which gives the 
behavior of the tiling below the cutting line; the other, which describes the tilings 
above the line, comes from reflecting the hexagon through the horizontal axis (and 
of course adjoining additional lozenges, as above). If < k < min(a,c), then 
one of the Gelfand patterns will include some of the augmenting vertical lozenges 
described above on both sides (a row of length a — k on one side of the top of the 
pattern, and a row of length c — k on the other) and the other pattern will contain 
augmenting vertical lozenges on neither side; if min(a, c) < k < (a + c)/2, then 
one of the Gelfand patterns will contain a row of \a — k\ augmenting lozenges on 
one side and the other Gelfand pattern will contain | c — A; | on the other side. (The 
case k > (a + c)/2 is symmetric to the case k < (a + c)/2, so we do not treat it 
explicitly.) 



In Theorem 2.2, we will use the following formula to determine how many tilings 



have a specified distribution of vertical lozenges along a horizontal line. 
Proposition 2.1. There are exactly 



n 



7 — l 
l<i<j<n J 

semi-strict Gelfand patterns with top row consisting of integers x\, x%, . . . , x n such 
that xi < X2 < ■ ■ ■ < x n . 



Proposition 2.1 has an explanation in terms of representation theory. Gelfand 
and Tsetlin ]GT| show that semi-strict Gelfand patterns form bases of representa- 
tions of SL(n), and one can deduce Propositi on | 2.1 | from this fact using the Weyl 
dimension formula. (The Gelfand patterns in |GT| are not semi-strict, but the re is 
an easy transformation that makes them so: Replace m PA in equation (3) of [ |GT| 
with rrip q + (q — p) and then reflect the triangle through its vertical axis.) Thus, 
the novelty of our approach is not that one can count semi-strict Gelfand patterns, 
but rather that one can count tilings with prescribed behavior on a horizontal line 



(as in Theorem [2.2] ). Another proof of Proposition 2T, and one that byp asse s 



its representation-theoretic significance, is the article of Carlitz and Stanley [CS| 



(That article does not deal directly with semi-strict Gelfand patterns, but it is easy 



to deduce Proposition 2.1 from the theorem proved there.) 



Proof. Let V(x\, . . . ,x n ) be the number of semi-strict Gelfand patterns with top 
row Xx,...,x n . Given any such pattern, the second row must be of the form 
j/i, ... , 2/n-i with Xi < yi < Xi+i for all i. Therefore, 

(2.1) v(x 1 ,..., Xn ) = XT XT--- XT 

Vi=xiy 2 =x 2 y„,-i=x„-i 

where the modified summation operator ^ L is defined by 

XT m = tm 

i—s i—s 

The advantage to writing it this way is that 

(2.2) E'/w + E'/w-E'/w 
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whenever r < s < t. There is a unique way to extend the definition of 

i—s 

to the case s > t, if we require that (E72J) be true for all r, s, t. Then starting from 



the base relation V(x\) = 1, we can use (2.1) to define V(x%, . . . ,x n ) for arbitrary 



integers x\, . . . ,x n (not necessarily satisfying x\ < • • • < x n ). 

We will prove the formula for V{x\, . . . , x n ) by induction on n. It is clearly true 
for — Suppose that for all y\, . . . , y n -i, 



(2.3) V(yi,...,y n -i)= [] 



i — i 

l<i<j<n-l J 



Then V(yi, ■ ■ . , y n -i) is a polynomial of total degree (n— l)(n— 2)/2 in y%, . . . , y n -i- 



When we put (|2.3|) into (2.1), we find that after each of the n summations in (2.1), 



the result is still a polynomial, and the degree increases by 1 each time. It is easy 



to check from (2.3) that the coefficient of y\y\ ■ ■ ■ y^-\ m V(yi,..., y n -x) is 

1 



l!2!...(n-2)!' 

From this and (2A) it follows that the coefficient of x\x\ . . . x™ -1 in V(xi, . . . , x n ) 



is 

1 



112!. ..(n- 1)! 
We have therefore shown that V(x\, . . . , x n ) and 



( 2 - 4 ) n 



7 — * 
l<i<j<n J 

are polynomials in Xi, . . . , x n of the same total degree, and with the same coefficient 
of 1 . If we can show that V(x±, . . . , x n ) is divisible by ( ^.4[ ), then they 

must be equal. Equivalently, we just need to show that V{x\, . . . , x n ) is skew- 
symmetric in x\, . . . , x n . 

For convenience, let (si,ti; S2, t%\ . . . ; s n _i,t„_i) denote 

H ■■■ ^(yi,y2,---,yn-i)- 

!/l=Sl V2 = S2 Vn-l=Sn-l 

We want to show that (x\, X2]X2,x^] . . . ; x n _i, x„) is skew-symmetric in x%, . . . , x n . 
To start off, notice that for all i, 

(2.5) (...; Si,Ui; ...) = (...; Si,ti; ...) + (... ;ti,Ui; .. .). 



Also, since V(yi, . . . , y n -i) is a skew-symmetric function of yx, . . . , y n _i by (2.3), 
for i < j we must have 

(• • • ; Sj, tj ; . . . ; Sj , tj ; . . . ) — (. . . ; Sj , tj ; . . . ; Sj , ; . . . ) . 

The relation (. . . ; Sj, t^; . . . ) = —(...; fj, sf, ... ) follows easily from the definition of 
(. . . ; Si, U; . . .). From the last two relations, we see that (. . . ; Sj, tj; . . . ; Sj, if, . . . ) 
vanishes if s, = Sj and t< = tj, or if s,; = tj and Sj = fj. 

To verify that (xi, X2', ■ ■ ■ ; x n _i, x„) is skew-symmetric in xi, . . . , x n , it suffices 
to check that it changes sign under transpositions of adjacent XiS. We check the 
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effect of exchanging Xi+y with Xi+ 2 as follows. If we write x' k — Xi+k (to simplify 
the subscripts), we have 

/ i i i i i i \ 

(. ■ . , Xq, x 2 \ x 2 , Xy, Xy, x 3 , . . . ) — 



(■■ 


/ 

• ) x 0i 


i . 

x l i 


1 

x 2i 


l 

x l'l 


1 

X ll 


1 

x 2i ■ 


•) 


+ (■■ 


■'i x Oi 


x 1 ; 


x 2i 


X \ j 


X 1l 


x 3i ■ 


•) 


+ (•• 


. ; Xy, 


2-2 ) 


x 2i 


x \ i 


X ll 


x 2 ; . 


•) 


+ (•• 


. ; Xy , 


x l'i 


x 2i 


X l) 


x 2l 


x 3' ■ 


•) 



by several applications of (|]^). All terms on the right except the second are 0, so 
(. . . ; x Q , x%; x 2 , Xy, Xy, x 3 ; . . . ) = (. . . ; x ,Xy',X2, Xy;x 2 ,x 3 ; . . .) 

(• • • ! x 0i x \'i x \i X 2 \ x 2 , x $', ■ ■ • )• 

Thus, exchanging x i+ i with x i+2 introduces a minus sign whenever i > 1 and 
i + 3 < n. The other cases (exchanging xy with x 2 or x n -y with easily dealt 

with in a similar way. Therefore, V(xy, . . . , x n ) is skew-symmetric in Xy, . . . , x n , so 
as discussed above we must have 



V(xy, . . . ,x n ) = Yl 



j — i 

l<i<j<n 

as desired. □ 

Notice that after some manipulation of the product 

V(l, 2,...,a,a + b + l,a + b + 2,...,a + b + c), 



Proposition 2.1 implies MacMahon's formula. However, our main application will 
be to counting tilings with prescribed behavior on horizontal lines. 

Consider the fc-th horizontal line from the top in an a, 6, c hexagon. If fc < 
min(a, c), then in every tiling there must be k vertical lozenges on the fc-th line; if 
min(a, c) < k < (a + c)/2, then there must be min(a, c) vertical lozenges on it. (As 
mentioned earlier, symmetry frees us from needing to treat the case fc > (a + c)/2, 
so we will routinely assume fc < (a + c)/2.) In either case, note that the number of 
vertical lozenges on the fc-th row is min(fc, a, c). 

Define the function 



V(xy, . . . ,x n ) = Yl 



7 

\<i<j<n 



as in the proof of Proposition 2.1. Then we have the following formulas: 

Theorem 2.2. Suppose we require that the vertical lozenges bisected by the k-th 
horizontal line from the top in an a, b, c hexagon occur at hexagonal positions 1 < 
ay < a 2 < ■ ■ ■ < at < b + min(fc,a) (and nowhere else), with I — min(fc,a, c). If 
k < min(a, c), there are 

V(ay, a 2 , ■ ■ ■ , ai)V(l, 2, . . . , a — fc,a — k + ay, . . . ,a—k + ai,a+b+l, . . . ,a + b+c— fc) 
such tilings. If a < c and a < fc < (a + c)/2, there are 

V(l, 2, . . . , k — a, k — a + ay, . . . , k— a-{-ae)V(ay, a 2 ,...,ai,a + b+l,... 1 a + b + c—k) 
such tilings (and a similar formula applies if c < a and c < k < (a + c)/2). 
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For the proof, simply notice that tilings of the parts of the hexagon above and 
below the fc-th line correspond naturally to semi-strict Gelfand patterns with certain 
top rows, and then apply Proposition 2.1 to count them. In both cases, the two 
factors correspond to the parts of the tiling that lie above and below the cutting 
line. 



3. Setting up the Functional 

We now turn to the proofs of our main theorems. As is usually done in situations 
such as ours, where one seeks to establish a law of large numbers for some class of 
combinatorial objects, we approach the problem by trying to find the individually 
most likely outcome (in this case, the individually most likely behavior of the height 
function on a fixed horizontal line) , and showing that outcomes that differ substan- 
tially from it are very unlikely, even considered in aggregate. We will begin in this 
section by setting up a functional to be maximized; the function that maximizes it 
will be a simple transformation of the average height function. 

Our method is to focus on the locations of the vertical lozenges rather than the 
height function per se. The two are intimately related, because, as we move across 
the tiling from left to right, the (unnormalized) height decreases by 2 when we cross 
a vertical lozenge and increases by 1 when we fail to cross a vertical lozenge. Thus, 
in determining the likely locations of vertical lozenges, we will in effect be solving 



for the average height function. Theorem 2.2 tells us that we can count tilings with 



prescribed behavior on horizontal lines, so we will start off by taking the logarithm 



of the product formula in Theorem 2.2 and interpreting it as a Riemann sum for a 
double integral. 

In fact, it will be convenient to look first at a more general product, and then 



apply our analysis of it to the product appearing in Theorem 2.2. Fix positive 
integers £ and n satisfying t < n, and non-negative integers til and ur. We will 
try to determine the distribution of (<zi, 02, ... , ai) satisfying 1 < a\ < a 2 < • ■ ■ < 
a,£ < n that maximizes 

V(l, 2, . . . , n L ,n L + ai, n L + a 2 , n L + a e ,n L + n + l, n L + n + 2, ... , n L + n+nii). 

For convenience, let 6j denote the i-th element of the sequence 1, 2, . . . , nj,, nj, + 
ai, . . . ,n L + a e ,n L + n + 1, . . . ,n L + n + n R . 

Set pl — TiL,/n, A — i/n, and pr = n R /n. We will work in the limit as n — > 00, 
with pl, A, and pR tending toward definite limits. For a more convenient way 
to keep track of 01, . . . , at as we pass to the continuum limit, we define a weakly 
increasing function A : [0, 1] — * [0, 1] as follows. Set A(i/n) = #{j : Oj < 1} /n for 
< i < n, and then interpolate by straight lines between these points. Similarly, 
set B(i/N) = #{j : b 3 < i}/N for < i < N, where N d = n L + n + n R , and then 
interpolate by straight lines. To simplify the notation later, set p = pl + Pr, so 
that N = (1 + p)n, A(0) = 0, A(l) = A, B{0) = 0, and 5(1) = (A + p)/(l + p). 

The functions A and B satisfy the Lipschitz condition with constant 1; that is, 
\A(s) - A(t)\ and \B(s) - B{t)\ are bounded by \s — t\. Note that the derivatives A' 
and B' are not defined everywhere, but they are undefined only at isolated points, 
and where they are defined they equal either or 1; when we make statements 
about A' and B' , we will typically ignore the points of non-differentiability. We can 
also derive a simple relation between A' and B' . To do so, notice that for < i < n, 
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we have 

B f i + n L \ = #{j : bj < i + n L } _ #{j :a j <i} n L = / z AT\ _n_ ri£ _ 
^ AT / AT A" A" \N ' n J N N ' 

if we set t = i/JV, we find that this equation becomes 

PL \ _ A((l + p)t) p L 



(3.1) Bit 



P 



For < t < 1/(1 + p), the values of A and B occurring in (3.1) are defi ned by 
interpolating linearly between points at which we have just shown that (3.1) holds, 
so it must hold for all such t. Therefore, for < t < 1/(1 + p), we have 

B 1 (t+-^-) =A'((l + p)t), 



except at isolated points of non-differentiability. All other values of B' are 1, since 
it follows immediately from the definition of B that for < t < pl/0- + p) or 
1 > t > (1 + Pl)/(1 + p), we have B'{t) = 1. 

We have aj = nA _1 (i/n), 6i = Af_B _1 (i/A r ). (Whenever we refer to A _1 (t), we 
consider it to take the smallest value possible, to avoid ambiguity, and we interpret 
S _1 (t) analogously.) When we take the logarithm and then multiply by 
double product 

v(h,...,b nL+l+nR )= n 



7 

l<i<j<n L +l+n R J 



ought to approach an integral like 



(1 + P) 2 // lo ^-^ dsdt. 



0<s<t<-^±£ 
— — i+p 

(The factor of (1 + p) 2 appears because N = {I + p)n and we rescaled by n~ 2 
instead of N~ 2 .) In the appendix, we will justify this claim rigorously, except 
with the function B replaced by a nicer function C . (The justification is not very 
difficult, but it is long enough that here it would be a distraction.) 
The conclusion from the appendix is that 

io g v(h,... bnMnR ) = 2 a 

n J J o< s <t<4±£ t - s 

where C is a certain strictly increasing, continuous, piecewise linear function satis- 
fying C(0) = 0, C(l) = (A + p)/(l + p), and \B' - C"\ = 0(1/N). Note that C has 
a continuous inverse (unlike B, which is only weakly increasing and thus may not 
even have an inverse). 

By symmetry, the integral equals 

a+# n» f » b /-w-^-c) M , 

t — s 
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We can write the integral as 



2 Jo Jo \t-s\ 

(1+P) 2 



2 

(l+p) 2 /"T# /"T-F? 



log I C - 1 (t) - C" 1 (s) | ds dt 

A+p . A+p 



/■T+J7 /-!+£ 

/ / log \t — s\ ds dt; 
Jo Jo 



since the integral being subtracted is a constant, we can ignore it. (The individual 
integrands are unbounded, but since the singularities are merely logarithmic they 
do not interfere with integrability.) Letting u = C _1 (s) and v = C _1 (t), we can 
rewrite the part that matters as 

a + p) 2 r 1 r 1 

K H> 1 1 C\u)C'(y)\og\u-v\dudv. 



10 Jo 

Since \C - B'\ = 0(1/N), this integral differs by o(l) from 

(3.2) ( l + P? j I B ' (u)B'(v) log \u-v\dudv. 

2 Jo Jo 

We will now use our formula expressing B' in terms of A! . Recall that 

for < t < 1, and that B'(t) = 1 for t £ (-pi,0) U (1, 1 + p R ). To take advantage 
of this, we change variables to s and t (which arc different from the s and t used 
earlier in the article) with u = (s + Pl)/(1 + p) and v = (t + pz,)/(l + p)- Then 
when < s,t < 1 we have B'(u) = A'(s) and B'(v) = A'(t); elsewhere B' is 1. 
Thus, (EO) is equal to 



1 



fl+PR pl+PR 

(3.3) i/ / (A'(s)+I(s))(A'(t) + I(t))log 



s - t 



1 + P 



ds dt, 



where / is the characteristic function of [— pl,0] U [1, 1 + pa], and where we set 
A 1 = outside [0, 1]. 

Removing the 1 + p from the denominator of the argument of the logarithm 
simply adds 



1+Ph rl+Pn 

/ (A' (a) + I(s)){A'(t) + I(t)) log |1 + p\ dsdt 

-PL J-PL 



to the integral; this quantity is a constant because the only occurrence of A in it is 
through the integral 

l + PR 

A'(s) ds = A(l) - A(Q) = X 

PL 

(and the square of this integral). We can also change the range of integration in 
(fO) to the entire plane (since the integrand has support only in the rectangle 
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[— pl, 1 + Pr] x [~pl, 1 + Pr])- Thus, we have arrived at the result that, for some 
irrelevant constant K, n~ 2 \ogV(b\, . . . , b nL +e+n R ) equals 

-i />oo />oo 

(3.4) -/ / {A'(s)+I(s))(A'(t)+I{t))log]s-t]dsdt + K + o(l). 

* J -oo J -oo 

We can now apply this to Theorem |2.2[ Suppose that on the fc-th line from the 
top in the a, b, c hexagon (with k < (a + c)/2), the vertical lozenges have hexagonal 

positions 1 < a\ < a 2 < ■ ■ ■ < ae < n, where n = f b + min(/c,a, c). Define the 



function A as above. Then our analysis so far, combined with Theorem 2.2, shows 
that if we take the logarithm of the number of tilings with the given behavior on the 
k-th line, and divide by n 2 , then we get the sum of two terms of the form (3.4); for, 



Theorem 2.2 gives us a product of two ^-expressions (whose exact nature depends 
on whether k lies between a and c), and when we take logarithms and divide by n 2 , 
we get two terms, each of which is half of a double integral (plus negligibly small 
terms and irrelevant constants). 

To put this into an appropriately general context, define the bilinear form (, ) by 

/•OO poo 

(3.5) (/,<?>=/ / f'(x)g'(y)log\x-y\dxdy 



for suitable functions / and g (for our purposes, functions such that their derivatives 
exist almost everywhere, are bounded, are integrable, and have compact support). 
We will now use this notation to continue the analysis begun in the previous para- 
graph. 

In this paragraph we will systematically omit additive constants and o(l) terms, 
since they would be a distraction. If k < min(a, c), then one of the two terms 
derived from Theorem ^2| is A), and the other is j{A + J , A + J ) , where 
Jo is a continuous function with derivative equal to the characteristic function of 
[— \a — k\/n, 0] U [1, 1 + |c — k\/n]. If min(o, c) < k < (a + c)/2, then one term is 
h(A + Ji, A + Ji) and the other is ^(A + J 2 , A + J 2 ), where the derivative of Ji 
is the characteristic function of [— \a — k\/n, 0] and that of Ji is the characteristic 
function of [1, 1 + |c — k\/n]. 

Now a few simple algebraic manipulations bring these results to the following 
form. 

Proposition 3.1. Letn = b+mm(k,a,c), pl = \a—k\/n, and pr = \c—k\/n. Then 
the logarithm of the number of tilings of an a, 6, c hexagon with vertical lozenges at 
hexagonal positions ai, ...,a< (and nowhere else) along the k-th line (where I = 
min(/c, a, c) and 1 < a\ < ■ ■ ■ < < n), when divided by n 2 , equals 

(A + J, A + J) + constant + o(l), 

where J is any continuous function whose derivative is half the characteristic func- 
tion of [—Pl, 0] U [1, 1 + pr], and A is defined (as earlier) by interpolating linearly 
between the values A{i/n) = : a.j < i}/n for < i < n. 

Recall that we are working in the limit as n — > oo with p^ , Pr, and A converging 
towards fixed limits. It is easy to check that the unspecified constant in Proposi- 



tion 3.1 converges as well. We would like everything to be stated in terms of the 
limiting values of pl, Pr, and A. Replacing p^ and pr in the definition of J by 
their limits changes (A + J,A + J) by only o(l), and we can increase A' by o(l) 
in such a way that ^4(0) remains 0, ^4(1) becomes the limiting value of A, and A 
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still satisfies the Lipschitz condition. Thus, we can let pl, p^, and A denote their 
limiting values from now on. 

We have now re-framed our problem. We must find a function A that maximizes 

V(A) = f (A + J, A + J) , subject to certain conditions. We will look at real- valued 
functions A on [0, 1] that are continuous, weakly increasing, and subject to the 
following constraints: A(0) — 0, A(l) — A, and A must satisfy a Lipschitz condition 
with constant 1 (so < A' < 1 where A' is defined). For convenience, define 
A(t) = for t < and A(t) = A for t > 1. Call a function A that satisfies 
these conditions admissible. Clearly, the functions A considered in this section are 
admissible. We will show in the next section that there is a unique admissible 
function A that maximizes V(A). (Notice that every admissible function A is 
differentiable almost everywhere, and A' is integrable and has compact support; 
for a proof of the necessary facts from real analysis, see Theorem 7.18 of f§. Thus, 
V(A) makes sense for every admissible A.) 



4. Analyzing the Functional 



Let T be the set of admissible functions. We can topologize T using the sup 
norm, L 1 norm, or L 2 norm on [0, 1]; it is straightforward to show that they all give 
the same topology, and that T is compact. In this section, we will show that V is a 
continuous function on !F, so it must attain a maximum. We will show furthermore 
that there is a unique function A S T such that V(A) is maximal. 

The proof will use several useful formulas for the bilinear form (, ) (formulas 



(4.1) to (4.3)). These formulas are derived in |LS|; we r epeat the d erivations here 
for completeness. One can find similar analysis in VK1 and [VK2|. 

The formulas are stated in terms of the Fourier and Hilbert transforms. For 
sufficiently well-behaved functions /, define the Fourier transform / of / by 



m- 

and the Hilbert transform / by 

J(x) 



f(t)e~ lxt dt 



fit) 
x — t 



dt 



(which we make sense of by taking the Cauchy principal value). Note that the 
Fourier transform of / is x i— ► i(sgax)f(x), and that of /' is x i— ► ixf(x); multiply- 
ing by i(sgnx) and multiplying by ix commute with each other, so differentiation 
commutes with the Hilbert transform. 

Integration by parts with respect to y in (|]^) shows that 

/oo 
f'(x)g(x)dx. 
-OO 

When done with respect to x : it also shows that 

(4.2) (f,g) = -n f(x)g'(x)dx. 



Unfortunately, Hilbert transforms are not always defined. For our purposes, it is 
enough to note that (4.1) makes sense and is true when /' and g have compact 
support, and similarly that (4.2) holds when / and g' have compact support. 
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If we set g = f and apply Parseval's identity to (4.1), we find that when / has 
compact support, 

1 f°° - 

(4-3) (/,/)= -- J Jf(x)\ 2 \x\dx. 

Thus, the bilinear form (,) is negative definite (on functions of compact support). 

We can now prove easily that V is a continuous function on T . To do so, notice 
that the definition of V and fl4.l|) imply that 

/oo 
f[{x)f 2 (x)dx, 
-oo 

where fa — A\ + + 2 J and fa = A\ — A2. Thus, since |/x( x )l — 2 for all x and 
fi=0 outside some interval I not depending on A\ and A%^ 

1/2 

MA,) - V(A 2 )\ < 2tt / |/ 2 | « ( / |/ 2 | 2 ) < ||/ 2 || 2 . 



(The second bound follows from applying the Cauchy-Schwartz inequality to \ f 2 | T.) 

It is known (see Theorem 90 of jTi)) and easy to prove (combine Parseval's 
identity with the formula for the Fourier transform of a Hilbcrt transform) that 
H/2II2 = H/2II2. Thus, |V(Ai) - V(A 2 )\ = OiWAi - A 2 \\ 2 ), so the function V is 
continuous on T . 

Because T is compact, V must attain a maximum on T . Now we apply the 
identity 

VjAi) + V(A 2 ) _ ( A x + A 2 \ t jA x ~A 2 A x - A 2 



which is a form of the polarization identity for quadratic forms. Because (Ai—A 2 )/2 



has compact support, (4.3) implies that 



V(Ai)+V(A 2 ) (Ai + A 2 



2 ~ V 2 

with equality if and only if A\ = A 2 . Thus, two different admissible functions 
could not both maximize V, since then their average would give an even larger 
value. Therefore, there is a unique admissible function that maximizes V. Let A be 
that function. (Notice that A depends on A, p^,, and pa, and hence on a, f3, and 
7, although our notation does not reflect this dependence.) 

We are now almost at the point of being able to prove that there is a function 



7^a,/3,7 such that Theorem 1.2 holds (except for the part relating Ti a ,i3^ to the 
explicitly given function V a ,p,i)- First, we need to relate A to the normalized 
average height function. 

Assume that k/(a + c) — > k as a, b, c — > oo for some k satisfying < k < 1. 
Choose normalized coordinates for the a, b, c hexagon so that the fc-th horizontal 
line from the top has normalized length 1 , and in particular coordinatize that line so 
that its left endpoint is and its right endpoint is 1; equivalently, coordinatize the 
a, /?, 7 hexagon so that the horizontal line that cuts it proportionately k of the way 



from its upper border has length 1. (The truth or falsity of Theorem 1.2 is clearly 
unaffected by our choice of coordinates.) We can then identify the scaling factor a 
with n, to within a factor of 1 + o(l). Given a tiling of the a, 6, c hexagon, if we 
scan to the right along this line, whenever we cross a vertical lozenge the normalized 
height function decreases by 2/n, and whenever we cross a location that could hold 
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a vertical lozenge but does not the normalized height function increases by 1 jn. It 
follows that the normalized height function at location t is given by 

-2A(t) + (t- A(t)) = t - 3A(t) 

plus the value at t = 0, since this function changes by the same amount as the 
normalized height function does as one moves to the right. 

Let e > 0. Then there exists a 5 > such that |V(A) - V(A)\ < S implies 
sup t \A(t) — A(t)\ < e. (This claim holds for every continuous function on a compact 
space that take s its maximal value at a unique point.) For n sufficiently large, 



Proposition 3T implies that if sup t \A(t) — A(t)\ > e, then in a random tiling, every 
behavior within o(l) of A is at least e™ ( <5 +°( 1 )) times as likely to occur along the k-th 
line as the behavior A is. Since the number of possibilities for A is only exponential 
in n, the probability that sup t | A(t) — A(t)\ > e is exponentially small in n 2 (and 
hence in a 2 ). In other words, the probability that a random height function differs 
along this line from the height function obtained from A is exponentially small 
in a 2 . It follows that the same is true without the restriction to the particular 
horizontal line, because of the Lipschitz constraint on height functions: if we show 
that large deviations are unlikely on a sufficiently dense (but finite) set of horizo ntal 
lines, then the same holds even between them. Thus, we have proved Theorem 1.2, 
except for the connection between TL a ,p,i an< ^ 

Furthermore, the density of vertical lozenges near location t along a given hori- 
zontal line is almost always approximately equal to A'(t). We can make this claim 
precise and justify it as follows. Given a random tiling, A(t) gives the number of 
vertical lozenges to the left of the location t, divided by n (plus (9(n _1 ) if t is not at 
a vertex of the underlying triangular lattice) . Thus, the number of vertical lozenges 
in an interval [a, 6] is n(A(b) — A(a)) + 0(1). We have seen that the probability 
that this quantity will differ by more than en from n(A(6) — A(a)) is exponentially 
small in n 2 . Therefore, as n — > oo (equivalently, a — ► oo), the expected value of 
A(b) — A(a) is A(b) — A(a) + o(l). Thus, we get the expected number of vertical 
lozenges in [a, b], which is also equal to the expected number of vertical lozenges in 
(a, b) (up to a negligible error). Notice that the o(l) error term is uniform in the 
choice of a, b, and the horizontal line, because the probability of a large deviation 
in height anywhere in the hexagon is small. 

If we take the result we have just proved for horizontal line segments and integrate 
it over the horizontal line segments that constitute the interior of any smooth simple 
closed curve, then we can conclude that Theorem 1.1 holds, except for the explicit 



determination of V a ,p.j (which is equivalent to the explicit determination of A, 
since V — A'). Also, notice that our method of proof implies that Tt a ,/3,-y and 
V a ,0,~[ must satisfy 

^ = 1 - 3P Qil a, 7 (x, y), 

as desired (although it does not yet determine them explicitly). Thus, all that 
remains to be done is to determine the maximizing function A explicitly. We will 
do so in Section |^. 

5. The Typical Height Function 

Unfortunately, it is not clear how to find the admissible function A that maxi- 
mizes V(A). Ordinary calculus of variations techniques will not produce admissible 



solutions. However, we will see that techniques similar to those used in [LS and 
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|VK1 , VK2{ can be used to verify that a function A maximizes V(A), if we can 



guess A. (It is not clear a priori that the techniques will work, but fortunately 
everything works out just as one would hope.) 

As we saw in Section [|, for the cases that are needed in the proof of Theorem |l.l| , 
an explicit formula for A is equivalent to one for V a ,/3.j', since we know already 
what the answer should be, guessing it will not present a problem. In Section [jj 
we tried to give some motivation by presenting a slightly simpler description of 
'PacPjj than the explicit formula. However, we do not know of any straightforward 
way to guess the answer from scratch. We arrived at it partly by analogy with the 
arctangent formula for random domino tilings of Aztec diamonds (Theorem 1 of 
|CEP[ |), partly on the basis of symmetry and simplicity, and partly on the basis of 
numerical evidence. 

To avoid unnecessarily complicated notation, we will solve the problem in greater 



generality than is needed simply for Theorem 1.1 . We will deal with the case of 
arbitrary A satisfying < A < 1, and arbitrary non- negative ph and p R (which we 
assume for simplicity are strictly greater than 0). We will use the same notation 
as earlier in the paper; for example, we set p = pi + Pr- Of course, guessing the 
admissible function that maximizes the functional in general requires additional 
effort, but the symmetry and elegance of the general formulas are a helpful guide. 

We will express the maximizing function A in terms of auxiliary functions /i and 
/ 2 . Define 

h{t) = 2t(l - t) - (A 2 + p\ - PR )t - (A 2 + P X - PL )(l - t) 

and 

f 2 (t) = (p + 2A) 2 i(l - t) - (A 2 + P X - p R ft - (A 2 + P X - PL f{\ - t). 

(Note that both expressions are invariant under (t,pL-,PR) <-> (1 — t, pr, pl); this 
observation reduces some of the work involved in verifying the claims that follow.) 
Since the discriminant of f2(t) is 

16A(1 - A)(A + PR )(X + Pi ) (A + p)(X + p + l), 

f%{t) has distinct real roots r\ < r 2 . We can show that both roots are in [0, 1] as 
follows. Since /2(C)) and / 2 (1) are at most 0, both roots of / 2 lie in [0, 1] if the point 
at which / 2 achieves its maximum does. The maximum occurs when 

(1 - A)p 2 + Xp R + 2A 2 + (2 - \)X PL + (2 + X)X PR + p L p R 
( PL + p R + 2A)2 

and this point is easily seen to lie in [0, 1]. 

We will specify the function A by specifying its derivative A', which together 
with the condition A(0) = uniquely determines the function. (We will then check 
that the newly defined A maximizes V, and is thus the same as the previous A.) 
For t G (fi,r 2 ) define 

A'(t) = icot- 1 ^ m ^ 

7T 



For t £ [0,ri] define A'(t) = lim t ^ ri+ A'(t), and similarly for t G [r 2 ,l] define 
A'(t) = lim t -,r 2 - A'(t). We can show that the first limit will be or 1 if r a S (0, 1), 
and the second will be or 1 if r 2 G (0,1); to verify this, it suffices to check 
(using resultants, for example) that /1 and / 2 cannot have a common root in (0, 1), 
from which it follows that at r\ or r 2 the denominator of the argument of the 
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arccotangent vanishes without the numerator vanishing. Notice that A'(0) = if 
/i(ri) > 0, and A'(0) = 1 if /i(n) < 0. Similarly, A'(l) = if /i(r 2 ) > 0, and 
A'(l) = 1 if /i(r2) < 0. Also, if r% — 0, then fi and f% both vanish at 0, and it 
follows that lim t ^o+ A'(i) = \\ similarly, if r% = 1 then lim t ^i_ A'(t) = i. 

Let A : [0, 1] — > [0, 1] be the unique function satisfying A(0) = with derivative 
A'. We will show later in this section that A(l) = A, from which it follows that A is 
an admissible function (since the other conditions are clearly satisfied). (We then 
extend A to a function on all of K in the usual way, so that A(t) = A for t > 1 and 
A(t) — for t < 0.) We will prove that A is the unique admissible function such 
that V(A) is maximal. 

Before beginning the proof, it is helpful for motivation to examine what the 
calculus of variations tells us about how the maximizing function should behave. 
For every admissible function A we have 

V(A) = (A + J,A+J), 

where J is any continuous function whose derivative is half the characteristic func- 
tion of [— pl, 0] U [1, 1 + pr\. Suppose we perturb our function A by adding to it a 
small bump centered at t, which we write as eSt where St is a delta function. (We 
should actually perturb by a bump rather than a spike, in order to try to maintain 
the Lipschitz constraint as long as e is small enough; however, treating St as a delta 
function gives the right answer.) Because 

(A + J + eS tl A + J + e5t) = (A + J, A + J) + 2e{S u A + J) + (9(e 2 ), 

the first variation of V is 2{5 t , A+ J). By (fb|), 

/CO 
5 t (x) {A'(x) + J'(x)j dx = -tt (A'(t) + J'(t)j . 

Thus, in order for the first variation to vanish, we must have A'(t) + J'(t) = 0. 
When A'(i) S (0, 1) this must be the case, assuming A maximizes V(A). However, 
when A'(t) € {0,1}, every perturbation violates the Lipschitz constraint, and we 
can conclude nothing. 

Our strategy for proving that A maximizes V(A) will take advantage of the 
vanishing of A'(i) + J'(t) (which we will prove directly later in this section). To 
begin, for any admissible A we write 

V(A) - V(A) = {A-A,A-A) + 2 (A — A, A + J). 

Because (A — A, A — A) < with equality if and only if A = A, in order to prove 
that V(A) < V(A), we need only show that 

2{A — A, A + J) < 0. 

To show that this is the case, we start by using (fL^), which tells us that 

/oo 
(A(t) -A(*))(A'(t) + J'(t))dt. 
-CO 

Thus, we want to show that 

/CO 
(A(t) - A(i))(A'(t) + J'(t)) dt > 0. 
-oo 

We will prove that the integrand is everywhere non-negative, by showing that 
A'(t) + J'(t) = when t e (n,r 2 ) (the interval where A'(t) e (0,1)), and that 
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in the rest of (0,1) its sign is the same as that of A(t) — A(t). (Outside (0,1), 
A(t) = A{t).) 

In order to prove these facts, we will apply the following theorem. For a proof, 
see Theorem 93 of g], p. 125]. 

Theorem 5.1. Let $ be a holomorphic function on the upper half plane, such that 
the integrals 

poo 

\$(x + iy)\ 2 dx 

exist for all y > and are bounded. Then there exists a function F on the real line 
such that for almost all x, <&{x + iy) — > F(x) as y — > 0+, and the imaginary part 
of F is the Hilbert transform of the real part of F. 

We will apply this theorem to determine the Hilbert transform of A' + J'. To 
prepare for the application of the theorem, we begin by defining, for t G {ri,r%), 



g(t) 



hit) 



and 

(5.2) $(i) = -cot^ 1 g{t). 

TT 

(Of course, $(t) = A'(t) on (r l7 r 2 ), but this new notation will help avoid confusion 
soon.) Then g extends to a unique holomorphic function on the (open) upper half 
plane. The function <E> extends as well to a unique holomorphic function on the 
upper half plane, together with all of R except the points — pl, 0, n, r-z, 1, and 
1 + Pr- To see why, notice that g(t) 2 + 1 has only four roots, in particular, simple 
roots at each of — pl, 0, 1, and 1 + pr. There is always a holomorphic branch 
of the arccotangent of a holomorphic function on a simply-connected domain, as 
long as that function does not take on the values ±i; this fact gives us the analytic 
continuation of Of course, cot7r$(i) = g(t) for all t in the upper half plane. 
For real t (except ~pl, 0, ri, r<i, 1, and 1 + Pr), define 

F(t) = lim $(t + is) = $(t). 

(We will use this notation to distinguish between the function F on the real line 
and the function 4> on the upper half plane.) In order to apply Theorem [0], we 
must determine the real and imaginary parts of F(t). Outside of (ri, r%), KeF(t) is 
piecewise constant (in particular, constant between the points where F is undefined) 
since F'(t) is imaginary there. The integrability of F'(t) at t = r\ and t = r 2 implies 
that F(t) is continuous there, which implies that KeF(t) — A'(t) for all t € (0, 1). 

To determine the behavior of Re F(t) for t $ (0, 1), we just have to see how much 
it changes by at ~pl, 0, 1, and pu, since it is constant on (— oo,— pif), (—pl,0), 
(1, 1 + pr), and (1 + pr,oo). To do so, notice that if $' has a pole with residue r 
at a point on the real axis, then F changes by —mi as one moves from the left of 
that point to its right. (To see this, integrate over a small semi-circle in the upper 
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half plane, centered at the point.) If g(u) = ±z, then 

1 , (t- u)g'(t) 
Res t=u $'(t = -lim- \ f.y 
7T t->u 1 + g(t) 2 

1 ,. (* " 

= — lim 



Trt^u (g(t) + g{u)){g{t) - g(u)) 
1 



2Trg(u) 

Therefore, if g(u) — ±i, then ReF changes by ±| from the left of u to its right. 

To determine the precise sign of g(u) when g(u) — ±i, we will need to know how 
l/vM*J behaves when analytically continued through the upper half plane. We 
know that it is positive on (ri,f2). If one analytically continues it along any path 
through the upper half plane that starts in (n, rg) and ends on the real axis to the 
left of r±, then the result is a negative imaginary number (i.e., one with argument 
—ni/2). Similarly, if the path ends to the right of r-z, then the result is a positive 
imaginary number. 

Thus, ff(-pi) = -isgn/i(-pi). In fact, g{-ph) = i, because 

M-Pl) = -A 2 - pA - pLp fl - p L - p| < 0. 

It follows that ReF increases by ^ at — pl- Similarly, ReF decreases by \ at l + p_R. 

The analysis at and 1 is slightly more subtle. We have g(0) = — isgn/i(0), 
and 5(1) = isgn/i(l). It turns out that sgn/i(0) = sgn/i(ri) and sgn/i(l) = 
sgn/i(r2). To prove this claim, we will deal with ImF. (The results about ImF 
will be needed later, so this approach is worthwhile even though one might wish 
for a direct proof.) 

It is impossible for lmF(t) to vanish for t £ (0, ri) U (r2, 1), since KeF(t) is or 
1 for such t, but F(t) cannot be or 1 (since otherwise g would have a singularity 



at t, as one can see from (5.2)). Thus, the sign of ImF(i) is constant for t in each 
of (0,ri) and (r 2 , 1). 

The imaginary part of the arccotangent does not depend on the branch used 
(since the values of the arccotangent always differ by a multiple of n). To determine 
the sign of the imaginary part of F(t), we will use the fact that for real u with 

M > i, 

(5.3) sgnlm cot _1 (m) = — sgnu. 

In order to apply this formula to F, we need to determine the sign of g on the axis. 
We determined above how 1/ yj fz(t) behaves. Since 

*(*) = ^cot- UHV 



we find, by combining the facts about 1/ y/fa (t) with (5.3), that sgnlmF(t) = 
sgn/i(t) for t e (0, ri), and sgnlmF(t) = — sgn/i(t) for t G (r 2 , 1). 

Because sgnlmF is constant, we can deduce two important facts. First, we 
see that sgn/i must be constant on (0, ri) and (f2, 1). Notice that in fact it is 
constant on [0, n] and [r2,l], because we showed earlier that it cannot vanish 
at one of the endpoints of one of these intervals unless the interval consists only 
of one point. Second, we see that sgnlmF(i) = sgn/i(n) for t £ (0,7*1) and 
sgnlmF(i) = -sgn/i(r 2 ) for t G (r 2 , 1). 
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and 
and 



Thus, having taken a short detour, we can see that g(0) — — isgn/i(ri), 
g(l) = «sgn/i(r2). It follows that ReF increases by | at iff /i(t"i) < 0, 
decreases by ^ at iff fi(ri) > 0. Notice that these are exactly the conditions 
under which A'(0) is 1 or 0, respectively. Similarly, ReF increases by \ at 1 iff 
fi( r 2) > 0, and decreases by h at 1 iff /i(r 2 ) < 0, and these are exactly the 
conditions under which A'(l) is or 1, respectively. 

The information that we have obtained determines Re F, and in fact shows that 



ReF(i) 



if i <-/>£, 

| if - p L < t < 0, 

A'(t) if < t < 1 (and t ^ n,r 2 ), 

\ if 1 < t < 1 + Pr, and 



if 1 + p R < t. 



In other words, Re F = A' + J'. 

We would like to apply Theorem 5.1 to conclude that the Hilbert transform of 
A' + J' must be ImF; for the hypotheses of the theorem to be satisfied, we need 
some integral estimates, in particular that the integrals 



|<3?(r + is)\ 2 dr 



exist for all s > and are bounded. To prove existence of the integral, we use the 
estimate $(t) — 0(i _1 ) as t — > oo. To verify this estimate, notice that as t — > oo 
in the upper half plane, g(t) = —(p/2 + A) _1 i< + 0(1), and for such t we have 

*(t) = - cot^H/?^ + \)~ 1 it + 0(1)) = l ^ t + °(^ 2 ) + fc > 

for some integer k depending on t and the branch of the arccotangent. For large 
t, continuity implies that k must be constant, and our knowledge of the behavior 
of <f> on the real axis tells us that k — 0. It follows that (f>(t) = 0(t _1 ), so 
the integrals must converge. To prove boundedness, we need only show that the 
integrals remain bounded as s — > 0. To see that they do, notice that the limiting 
integrand has singularities, but they are only logarithmic singularities (since $' has 
poles of order 1 there), so it is still integrable. 

We can now verify that A(l) = A. (This fact, which is necessary for A to be 
admissible, was stated earlier, but the proof was postponed.) To determine A(l), we 
need to integrate A' from to 1. If C denotes a semi-circle of radius R centered at 0, 
lying in the upper half plane, and oriented clockwise, then for R > max(p^, l + p#), 
Cauchy's theorem implies that 

R 



Re / $>(t)dt = Re / F(t)dt 
Jc J-r 



l + PR 

A'(t) + J' (t) dt 

PL 
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Since $(<) = t(p/2 . +A) + 0(f 



[$(t)dt = / ?(2+A) dt + OiR- 1 ) 

JC JC Trf 



<(§ + A) 



Hence, A(l) = A. 



Now Theorem 5.1 tells us that because 



F(t) = lim $(i + is) 

(except where undefined), the imaginary part of F is the Hilbert transform of the 
real part, i.e., 

A 1 + J' = hxiF. 



To complete the proof of (5.1), we need more information about how lmF(t) be- 
haves for t 6 [0, 1]. We know that ImF(i) = for t G (ri, r 2 ), by the definition of 
<I>, so for such t, 

(A(t) - A(t))(A'(t) + J'(t)) = 0. 

If we can ensure that 

(5.4) (A(t)-A(t))(AJ(t) + J'(t)) >0 



for all t G (0,r±) U (r 2 , 1), then we will have proved (5.1). 

We will deal first with the sign of A(t)— A(t). Recall that A' is constant on (0, r%), 
and is either or 1 (assuming r\ > 0). Because of the Lipschitz condition < A' < 
1, it follows that either A'(t) - A'(t) > for all t € (0,ri), or A'(t) - A'(t) < 
for all such t, according as A' is or 1 on that interval. Integrating and using 
A(0) = A(0) = implies that A(t) - A(t) > for t e (0, n) in the first case (where 
A'(0) = 0), and A(t) - A(t) < in the second (where A'(0) = 1). Similarly, if 
A'(l) = then A(t) - A(t) < for t G (r 2 , 1), and if A'(l) = 1 then A(t) - A(t) > 
for t G (7*2, 1). Therefore, to prove ( |5.4| ), we need only prove the same inequalities 
as here, with A(t) — A(t) replaced by lmF(t). 

We have already shown that sgnlmF(t) = sgn/i(ri) for t E (0,ri), and that 
sgnImF(<) = -sgn/i(r 2 ) for t G (r 2 ,l). We know as well that A'(0) = if 
/i(r x ) > and A'(0) = 1 if fi{r x ) < 0, and that similarly, A'(l) = if / x (r 2 ) > 
and A'(l) = 1 if /i(r 2 ) < 0. (Note that the only possible conditions under which r\ 
or r 2 are roots of /1 are 7*1 = and r 2 = 1, respectively.) These conditions, when 
combined with those derived in the previous paragraph, give us what we need. We 
conclude that for t G (0, n) U (r 2 , 1), we have A(t) - A(i) > iff ImF(t) > 0. Since 
ImF(i) = A'(t) + J'(t), and is for t G (r l5 r 2 ), we see that for all t G (0, 1) (and 
trivially for t g (0, 1) since then A(t) = A(t)), 

(A(i) - A(i))(A(t) + J'(i)) > 0. 



This inequality implies (5.1), which completes our proof. 

Thus, A is indeed the unique admissible function such that V(A) is maximal. 
We leave to the reader the task of checking that applying this result to the specific 



in Theorem 1.1 



functional arrived at in Proposition 3.1 leads to the explicit formula for V a ,p^ given 
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6. Conjectures and Open Questions 

The theorems we have proved do not answer all the natural questions about the 
typical plane partition in a box, or about random lozenge tilings of hexagons. 

Given a location (x,y) in the normalized a,f3,j hexagon, we can ask whether 
the probability of finding a vertical lozenge near (x,y) is given by Ta t p n (x,y). 



Theorem 1.1 tells us that this is true if we average over all (x, y) in some macroscopic 
region. However, it is conceivable that there might be small-scale fluctuations in 
the probabilities that would even out on a large scale. We believe that that is not 
the case. 

Conjecture 6.1. Let V be any open set in the a, ft, 7 hexagon containing the four 
points at which Va,/3.j is discontinuous. As a — > 00, the probability of finding a 
vertical lozenge at normalized location (x, y) $ V is given by V a ,/3,-y(x,y) + o(l), 
where the o(l) error bound is uniform in (x,y) for (x,y) ^ V. 

There is numerical evidence that Conjecture |6.l| is true. Also, the analogous 
result for random domino tilings of Aztec diamonds has been proved in ]CEP |, 



and it is not hard to prove that the local statistics for the one-dimensional case 
described in Section [j] do in fact converge to i.i.d. statistics, so it is plausible that 
Conjecture |6.l| is true. A similar result should also hold for higher-order statistics 
(probabilities of finding configurations of several lozenges) ; one can deduce explicit 
hypothetical formulas for these probabilities from the theorems and conjectures in 
ICKPfl . 

Of course, we do not need to restrict our attention to hexagons, but can look at 
tilings with lozenges of any regions. Hexagons do seem to involve the most elegant 
combinatorics and analysis, but one can prove in general that almost all tilings of 
large regions have normalized height functions that cluster around the solution to 



the problem of maximizing a certain entropy functional; see [CKP] for the details 



We also conjecture an analogue of the arctic circle theorem of Jockusch, Propp, 



and Shor. (See | JP£ | for the original proof, or [ CEP for the proof of a stronger 
version on which our conjecture is based.) Define the arctic region of a lozenge 
tiling to be the set of lozenges connected to the boundary by sequences of adjacent 
lozenges of the same orientation (where a lozenge is said to be adjacent to another 
lozenge, or to the boundary, if they share an edge). 

Conjecture 6.2 (Arctic Ellipse Conjecture). Fix e > 0. The probability that the 
boundary of the arctic region is more than a distance e (in normalized coordinates) 
from the inscribed ellipse is exponentially small in the scaling factor a. 

There are also several questions for which we do not even have conjectural an- 
swers. 

Open Question 6.3. Is there a simple way to derive the results of Section || with- 
out having to guess any formulas? 

Such a method would be much more pleasant than our approach. A good test 
case would be the following open question. 

Open Question 6.4. Is there a g-analogue of Theorem |l.l| ? 

Of course, the non-trivial situation is when q — > 1 as a — > 00 (although we do not 
know the precise relationship between q and a that will lead to interesting limiting 
behavior). There is a simple q-analogue of MacMahon's enumeration of boxed plane 
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partitions (also due to MacMahon), and a g-analogue of Proposition 2.1 (which can 



be established by a proof similar to that of Proposition 2.1). We believe that the 
answer to Open Question is yes, and that the same approach should work, but 
the guess work required is likely to be tedious. We hope that further development 
of these techniques will someday let one answer such questions more easily. 



Open Question 6.5. Is there an analogue of Theorem [l.l| for "space partitions" 
(the natural generalization of plane partitions from the plane to space)? 



The answer to Open Question 6.5 may well be yes, but it is extremely unlikely 
that similar techniques can be used to prove it. (For example, no analogue of 
MacMahon's formula is known, and there is no reason to believe that one exists.) 

Appendix. Converting the Sum to an Integral 
In Section |§|, we had to convert a sum to an integral. The sum was 

(6.i) y log fB-Hj/N)-B-Hi/N)\fi " 



^ 6 V j/N-i/N J \N 

l<Kj<n L +£+n R x ' ' ' x 

and we interpreted it as a Riemann sum for the double integral 
(6.2) // , oe C-W-C-(») dsdt| 

for some function C which we have not yet specified. In Section [| we claimed that 
the difference between the sum and the integral is o(l), and that C can be chosen 
so that B' and C nowhere differ by more than 0(1/N). In this appendix, we will 
define C and justify these claims. 

The main obstacle is that (B~ 1 )' can be quite large (infinite, in fact, when 
B' = 0). We will now define a modification C of B designed to keep (C -1 )' from 
being too large. We put C(0) = and C{bi/N) = i/N for 1 < i < n L + I + n R 
(so that C(l) = (A + p)/(l + p)). Between these points, we will define C so that it 
is a continuous, strictly increasing, piccewisc linear function on [0, 1] such that C 
is constant on intervals (i/N, (i + 1)/N), C is never smaller than 1/N 2 or greater 
than 1, and \C — B'\ = 0(1/N). There is no canonical way to do this; one way that 
works is as follows. If b i+1 > b, t + 1, then set C((b i+1 - 1)/N) = i/N + 1/N 2 (and 
similarly if bi > 1 set C((b\ — 1)/N) = 1/N 2 ). Then interpolate linearly to define 
C in between the points at which we have defined it so far. Notice that changing 
BtoC does not change the sum (f3j), since C _1 (i/JV) = h/N = B^^/N). 

To begin, for < s < t < j±£, we define 

f(s,t) = log . 

t — s 

Because (C -1 )' = 0(N 2 ), the mean value theorem implies that f(s,t) — 0(\ogN). 

Consider small squares of side length iV _1 , with their sides aligned with the 
s- and t-ax.es and their upper right corners at (s, t) = (i/N,j/N), for 1 < i < j < 
ul+ £ + nR. Each square has area N~ 2 , and / is bounded by 0(log N), so we can 
safely remove up to o(N 2 / log N) squares from the domain of integration and the 
corresponding terms from the sum without changing either by more than o(l). We 
will do so in order to restrict our attention to squares on which we can bound the 
partial derivatives of /. 
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We first remove the 0(N 3 / 2 ) squares containing some point (s, t) with \s — t\ < 

Next, we remove all squares containing some point (s,t) satisfying (C _1 )'(s) > 
iV 1 / 3 or (C _1 )'(i) > N 1 / 3 . We can check as follows that there are at most 0{N 5 / 3 ) 
such squares. Since C" 1 is increasing and has range contained in [0,1], the set 
of all t with (C _1 )'(i) > TV 1 / 3 has measure 0{N~ 1 / 3 ). Also, as (s,i) varies over 
each small square, (C _1 )'(s) and (C _1 )'(t) are constant. Hence, in this step we are 
removing at most 0(N^ 1 / 3 /N^ 2 ) = 0(N 5 ^ 3 ) squares. 

Thus, we can restrict our attention to squares containing only points (s, t) with 
\s-t\ > N- 1 ' 2 , {C- l )'{s) < N 1 ' 3 , and (C^ 1 )'^) < N 1 ' 3 . 

Now we will estimate the difference between the sum and the integral. If we can 
show that on each square, / varies by at most o(l) (uniformly for all squares), then 
we will be done. To determine how much / can vary over a square of side length 
-/V -1 , we compute 

df (c- r Y(t) 1 

dt C- 1 (t)-C- 1 (a) t-a' 

The second term has absolute value at most N 1 ' 2 (since by assumption \s — t\ > 
TV -1 / 2 ). To bound the first term, we start with the denominator. Because C < 1 
everywhere, we have (C -1 )' > 1, so |C _1 (i) — C _1 (s)| > \s — t\, and thus 



< 



C-\t)-C-^s) 
Finally, since (C _1 )'(t) < A^ 1/3 , we have 

^ = 0(N 5 / 6 ) = o(N). 

The same holds for df/ds, of course. 

Therefore, over one of the small squares of side length iV _1 , / can vary by at 



most o(l). Thus, the sum (p.l\j differs from the integral (6.2) by at most o(l) 
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